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Input file Fbh56919FL2 .seg 
Sequence length 3003 

TTCGGCACCAGGCTGCTGCGGGGGGACTCTTTCTGAGGTTACTGTGGAGCACCCAAAGTCTGTCAGCCTCTGGCCGTGC 

AAACAGGCACCCAGAGGAACCAGACCTTGCTTATTCACCCACAGCCTGGGACTGTCTTCTCCAGAGTCTCCATCAGCTT 

TGCTAATCGACTGATTGGAAATAATTCCTCAAACACCACCAAGTCAAGGATACAGGCAGCAGCGGCTCCCCTGTTGTAT 

GGACATTCTGCACCCGAAACTGATAGCTGAGTCCTGAAGTTTTATGTTATGAAACAGAAGAACTTTCATCCqXGCACAT 

MDESALTLGTIDV 13 
GATTTGGGAATTACACTTTGTGAC ATG GAT GAA TCT GCA CTG ACC CTT GGT ACA ATA GAT GTT 39 
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ISYDRIIEGHY 
ATC TCC TAT GAT CGC ATT ATC GAA GGT CAC TAC 

KNESLWSVARG 
AAG AAT GAG AGC CTG TGG AGT GTA GCA AGA GGT 

GCVRVDFAQPF 
GGT TGT GTC CGA GTG GAT TTT GCA CAG CCA TTT 

SQKPVSALLSL 
AGT CAG AAA CCG GTG TCT GCT CTA CTT TCC CTG 



PSRPSDAADEG 
CCT TCA AGA CCC AGT GAT GCT GCT GAT GAA GGT 

NATDESLRRRL 
AAT GCA ACA GAT GAA TCC CTA CGA AGG AGG TTG 

F I'ir ASKSCAIMS 
TTC 'ACT GCT AGC AAG TCC TGT GCC ATT ATG TCC 

Y lUR HRQGIDLST 
TAC|p|GA CAC AGG CAG GGA ATT GAT CTC TCC ACA 

ffi 

E 'ZE VLARDFDLG 
GAG ; GAA GTC CTG GCT CGT GAT TTT GAC CTG GGG 

M r HAIQLLGNCV 
ATG: CAT GCC ATA CAG CTG CTG GGA AAT TGT GTC 

£ I -J p p x T P S T T V P 
GAG: TTT TTT ATC ACC CCC AGC ACA ACT GTC CCA 

N i. G V L H V F I M E A 
AAT! GGG GTA CTT CAT GTC TTT ATC ATG GAG GCC 

LNKRGL.GGPTS 
CTG AAC AAG AGG GGA CTG GGG GGT CCC ACT AGC 

QLVRKAASLCY 
CAG CTG GTG CGG AAG GCG GCC AGC CTG TGC TAC 

LPCQTFYQVCH 
CTG CCT TGC CAG ACA TTT TAC CAA GTC- TGC CAT 

G-ILTVAEHDDQ 
GGC ATT CTT ACA GTG GCA GAG CAC GAT GAC CAG 

QQ WDKKLPEPL 
CAG CAG TGG GAC AAG AAG CTT CCA GAA CCT TTG 

DSDFGEEQRDC 
GAC AGT GAC TTT GGG GAG GAA CAG CGA GAT TGC 

HQQFITFLQRL. 
CAC CAG CAG TTT ATC ACC TTC TTA CAG AGA CTC 

SAAIFVHNFSG 
TCT GCT GCC ATC TTT GTT CAC AAC TTC AGT GGT 

KLHKYLITRTE 
AAG TTG CAC AAA TAC CTA ATA ACC AGA ACA GAA 
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ATYCLVKNAVKMFKD IGVFK 793 

GCC AC A TAT TGT CTT GTG AAG AAT GCT GTG AAA ATG TTT AAG GAT ATT GGG GTT TTC AAG 2379 

ETKQKRVSVLELSSTFLPQC 813 

GAG ACC AAA CAA AAG AGA GTG TCT GTT TTA GAA CTG AGC AGC ACT TTT CTA CCT CAA TGC 2439 

NRQKLLEYILSFVVL * 829 

AAC CGA CAA AAA CTT CTA GAA TAT ATT CTG AGT TTT GTG GTG CTG TAG 2487 

GTAACGTGTGGCACTGCTGGCAAATGAAGGTCATGAGATGAGTTCCTTGTAGGTACCAGCTTCTGGCTCAAGAGTTTGA 

AGGTGCCTTCGCAGGGGTCAGGCCTGCCCTGTNCCGAAGTGATCTCCTGGAAGACAAGTGCCTTCTNCCTCCATGGATC 



TGAGATCTTCCCAGCTTT 
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Protein Family / Domain Matches, HMMer version 2 

Searching for complete domains in PFAM 

hmmpfam - search a single seq against HMM database 

HMMER 2.1.1 (Dec 19 98) 

Copyright (C) 1992-1998 Washington University School of Medicine 

HMMER is freely distributed under the GNU General Public License (GPL) . 



HMM file: /prod/ddm/seqanal/PFAM/pf am6 . 4 /Pf am 

Sequence file: /prod/ddm/wspace/orfanal/oa-script .21255 . seq 



Query: 56919 

Scores for sequence family classification (score includes all domains) : 
Model Description Score E-value N 



Acyltransf erase Acyl transferase 126.1 6.4e-34 



Parsed for domains : 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 



Acyltransferase 1/1 215 412 . . 1 195 [] 126.1 6.4e-34 

Alignments of top-scoring domains: 

Acyltransferase: domain 1 of 1, from 215 to 412: score 126.1, E = 6.4e-34 
*->lenlpkkgpaiwsNHrSylDilvlsaalprrgpwlvrrlvf iakke 
+ +++++ p ++ + HrS++D+l+l ++1++++ ++ +ia ++ 

56919 215 KAATETNLPLLFLPVHRSHIDYLLLTFILFCHN IKAPYIASGN 257 



llkvPllfGwlmrlagaif idRnnra kdalaaadelvrvlellrk 

+1++P+ f++l+ ++g +fi+R+ +++++++kd l++a+ + + +ellr+ 
2 58 NLNIPI - FSTLIHKLGGFFIRRRLDEtpdgrKDVLYRALLHGHI VELLRQ 3 06 

grsvliFPEGTRsrsgellppfKkGia af rlAlkagvpivPwiv 

+ + iF EGTRsrsg++ + ++G+++ + + ++ ++ i+Pv+i 
307 QQFLEIFLEGTRSRSGKTSC- ARAGLLsvwdTLSTNVIPDILI IPVGI - 354 

sgteelepkneagklLrlarkkgpvtvrvlppipld. . pedikelaerlr 

355 SYDRI IEGHYNGEQ — LGKPKKNESLWS VARGVIRMlrKNYGCVRVDFAQ 402 
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10 20 30 40 

1 MDESALTLGTIDVSYLPHSSEYSVGRCKHTSEEWGECGFR 56919. pro 

1 MEESSVTVGTIDVSYLPSSSEYSLGRCKHTSEDWVDCGFK MouseGPAT . PRO 

1 MEESSVTIGTIDVSYLPNSSEYSLGRCKHTNEDWVDCGFK RatGPAT . PRO 

1 1 1 r 

50 60 70 80 

41 PTVFRSATLKWKESLMSRKRPFVGRCCYSCTPQSWDKFFN 56919. pro 
41 PTFFRSATLKWKESLMSRKRPFVGRCCYSCTPQSWERFFN MouseGPAT . PRO 
41 PTFFRSATLKWKESLMSRKRPFVGRCCYSCTPQSWERFFN RatGPAT. PRO 

1 1 1 r 

90 100 110 120 

81 PSIPSLGLRNVIYINETHTRHRGWLARRLSYVLFIQERDV 56919. pro 
81 PSIPSLGLRNVIYINETHTRHRGWLARRLSYILFVQERDV MouseGPAT . PRO 
81 PSIPSLGLRNVIYINETHTRHRGWLARRLSYILFVQERDV RatGPAT . PRO 

1 1 1 — r- 

130 140 150 160 
pi 1 l_ I 1_ 

121 H pfiG MFATNVTENVLNSSRVQEAIAEVAAELNPDGSAQQQS 56919 .pro 
121 H ffc MFATSVTENVLSSSRVQEAIAEVAAELNPDGSAQQQS MouseGPAT . PRO 
121 H jjfjG MFAT S ITDNVLN S SRVQEAIAEVAAELN PDGSAQQQS RatGPAT . PRO 



llJ 170 180 190 200 
I I 1 L. 

161 K |C-V NKVKKKAKRILQEMVATVSPAMIRLTGWVLLKLFNSF 56919. pro 
161 KA"IQKVKRKARKILQEMVATVSPGMIRLTGWVLLKLFNSF MouseGPAT . PRO 
161 K j^j I Q K V K R K A R K I L Q E M V A T V S P G M I R L T G W V L L K L F N S F RatGPAT . PRO 



h! 210 220 230 240 
IZ I I I 1_ 

201 Fi^NIQIHKGQLEMVKAATETNLPLLFLPVHRSHIDYLLLT 56919. pro 
201 F©NIQIHKGQLEMVKAATETNLPLLFLPVHRSHIDYLLLT MouseGPAT . PRO 
201 Fl#NIQIHKGQLEMVKAATETNLPLLFLPVHRSHIDYLLLT RatGPAT. PRO 

1 1 1 r 

250 260 270 280 

241 FILFCHNIKAPYIASGNNLNIPIFSTLIHKLGGFFIRRRL 56919. pro 
241 FILFCHNIKAPYIASGNNLNIPVFSTLIHKLGGFFIRRRL MouseGPAT . PRO 
241 FILFCHNIKAPYIASGNNLNIPIFSTLIHKLGGFFIRRRL RatGPAT . PRO 



290 300 310 320 

281 DETPDGRKDVLYRALLHGHIVELLRQQQFLEIFLEGTRSR 56919. pro 
281 DETPDGRKDILYRALLHGHVVELLRQQQFLEIFLEGTRSR MouseGPAT . PRO 
281 DET PDGRKDILYRALLHGHIVELLRQQQFLEI FLEGTRSR RatGPAT . PRO 



330 340 350 360 
I I I L_ 

321 SGKTSCARAGLLSVVVDTLSTNVIPDILIIPVGISYDRI I 56919. pro 

321 SGKTSCARAGVLSVVVNTLSSNTIPDILVIPVGISYDRII MouseGPAT . PRO 

321 SGKTSCARAGLLSVVVDTLSSNTIPDILVIPVGISYDRII RatGPAT . PRO 
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370 380 390 400 
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361 


EGHYNGEQLGKPKKNESLWSVARGVIRMLRKNYGCVRVDF 56919. pro 
EGHYNGEQLGKPKKNESLWSVARGVIRMLRKNYGYVRVDF MouseGPAT . PRO 
EGHYNGEQLGKPKKNESLWSVARGVIRMLRKNYGYVRVDF RatGPAT . PRO 
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AQPFSLKEYLESQSQKPVSALLSLEQALLPAILPSRPSDA 56919. pro 
AQPFSLKEYLEGQSQKPVSAPLSLEQALLPAILPSRPNDV MouseGPAT . PRO 
AQPFSLKEYLEGQSQKPVSAPLSLEQALLPAILPSRPDAA RatGPAT. PRO 
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ADEGRDTSINESRNATDESLRRRLIANLAEHILFTASKSC 56919. pro 
ADEHQDLSSNESRNPADEAFRRRLIANLAEHILFTASKSC MouseGPAT . PRO 
AAEHEDMSSNESRNAADEAFRRRLIANLAEHILFTASKSC RatGPAT . PRO 
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A*! MSTHIVACLLLYRHRQGIDLSTLVEDFFVMKEEVLARD 56919. pro 

A 'I MSTHIVACLLLYRHRQGIHLSTLVEDFFVMKEEVLARD MouseGPAT . PRO 

Ajl MSTHIVACLLLYRHRQGIHLSTLVEDFFVMKEEVLARD RatGPAT . PRO 

-ifl 1 1 1 
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F,'g LGFSGNSEDVVMHAIQLLGNCVTITHTSRNDEFFITPS 56919. pro 
fQ> LGFSGNSEDVVMHAIQLLGNCVTITHTSRKDEFFITPS MouseGPAT . PRO 
F; J D LGFSGNSEDVVMHAIQLLGNCVTITHTSRKDEFFITPS RatGPAT . PRO 
!" : 




sT\ 570 580 590 600 


561 
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lf'*F VPSVFELNFYSNGVLHVFIMEAIIACSLYAVLNKRGLG 56919. pro 
TlSvPSVFELNFYSNGVLHVFIMEAIIACSIYAVLNKRCSG MouseGPAT . PRO 
Tj-f-T VPSVFELNFYSNGVLHVFIMEAIIACSIYAVQNKRGSG RatGPAT . PRO 
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GPTSTPPNLISQEQLVRKAASLCYLLSNEGTISLPCQTFY 56919. pro 
GSAGGLGNLISQEQLVRKAASLCYLLSNEGTISLPCQTFY MouseGPAT . PRO 
GSAGGLGNLISQEQLVRKAASLCYLLSNEGTISLPCQTFY RatGPAT. PRO 
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641 
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QVCHETVGKFIQYGILTVAEHDDQEDISPSLAEQQWDKKL 56919. pro 
QVCHETVGKFIQYGILTVAEQDDQEDVSPGLAEQQWDKKL MouseGPAT . PRO 
QVCQETVGKFIQYGILTVAEQDDQEDVSPGLAEQQWNKKL RatGPAT . PRO 




1 1 — i r 

690 700 710 720 



681 PEPLSWRSDEEDEDSDFGEEQRDCYLKVSQSKEHQQFITF 56919. pro 

681 PE-LNWRSDEEDEDSDFGEEQRDCYLKVSQSKEHQQFITF MouseGPAT . PRO 

681 PEPLNWRSDEEDEDSDFGEEQRDCYLKVSQAKEHQQFITF RatGPAT . PRO 
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750 760 



721LQRLLGPLLEAYSSAAIFVHNFSGPVPEPEYLQKLHKYLI 56919. pro 

720 LQRLLGPLLEAYSSAAIFVHNFSGPVPESEYLQKLHRYLI MouseGPAT . PRO 

721LQRLLGPLLEAYSSAAIFVHTFRGPVPESEYLQKLHRYLL RatGPAT . PRO 



780 790 800 



761TRTERNVAVYAESATYCLVKNAVKMFKDIGVFKETKQKRV 56919. pro 
760 TRTERNVAVYAESATYCLVKNAVKMFKDIGVFKETKQKRV MouseGPAT . PRO 
761TRTERNVAVYAESATYCLVKNAVKMFKDIGVFKETKQKRA RatGPAT . PRO 



801 SVLELSSTFLPQCNRQKLLEYILSFVVL 56919. pro 

800 SVLELSSTFLPQCNRQKLLEYILSFVVL MouseGPAT . PRO 

801 SVLELSTTFLPQGSRQKLLEYILSFVVL RatGPAT. PRO 



Acyltransf erase catalytic motif-I 



I FLEGTRSR 
I FLEGTRSR 
I FLEGTRSR 
YFVEGGRSR 



56919. pro 
MouseGPAT . PRO 
RatGPAT . PRO 
EcoliGPAT. PRO 



Acyltransf erase catalyic motif-II 



H R S H I D 56919. pro 

H R S H I D MouseGPAT . PRO 

H R S H I D RatGPAT. PRO 

H R S H M D EcoliGPAT . PRO 



Acyltransf erase catalytic motif-III 



I L I I P v 

i l v i p v 

I L V I P V 

I T L I P I 



56919. pro 
MouseGPAT . PRO 
RatGPAT. PRO 
EcoliGPAT. PRO 



Acyltransf erase signature motif 



G G F F I R R 56919. pro 

G G F F I R R MouseGPAT. PRO 

G G F F I R R RatGPAT. PRO 

G A F F I R R EcoliGPAT. PRO 
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